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[57] ABSTRACT 

A method and apparatus for analyzing and synthesizing 
speech includes a programmable^lattice-iadder notch filter 
which may be programmed to exhibit both filter poles and 
filter zeros and thereby exhibit a power spectral density with 
a better fit to that of a speech frame such that, when 
energized by a selected signal sample, a more accurate 
regeneration of speech is achieved. The filter parameters 
may be reliably and systematically determined as a single 
solution to a mathematical analysis given a set of gain 
parameters matching the observed covariance data and hav- 
ing a prescribed set of transmission zeros. These transmis- 
sion zeros may either be preselected as a design 
specification, or recovered from analysis of the speech data. 
A speech frame may be analyzed and provide a set of 
parameters which may be transmitted to a remote location 
where a synthesizer may accurately reproduce the speech 
frame. 

32 Claims, 3 Drawing Sheets 



1. CORRELATION 
COMPUTATION 



5. EXCITATION 
SIGNAL 
SELECTION 



2. ZERO 
COMPUTATION 



3. LLN 
FILTER 
ANALYSIS 



1. INITIAL 
CONDITION 



CODING PART 



DECODING PART 







I 


6. EXCITATION 
GENERATOR 




7. LLN 
FILTER 







11/06/2003, EAST Version: 1.4.1 



5,940,791 

Page 2 



OTHER PUBLICATIONS 

TT. Georgiou, Realization of Power Spectra from Partial 
Covariance Sequences, IEEE Trans, on Acoustics, Speech 
and Signal Processing, vol. ASSP-35, No. 4 (1987) pp. 
438-^49. 

C.I. Byrnes, H.J. Landau and A. Lindquist, On the we 11- 
posedness of the rational convariance extension problem in 
Current and Future Directions in Applied Mathematics, M. 
Alber, B. Hu and J. Rosenthal (editors), Birkhauser Boston 
(1997), pp. 83-108. 

C.I. Byrnes and A. Lindquist, An algebraic description of the 
rational solutions of the covariance extension problem, in 
Linear Circuits, Systems and Signal Processing, C.I. Byrnes, 
C.F. Martin and R.E. Saeks (editors), Elsevier (1988), pp. 
9-17. 

C.I. Byrnes and A. Lindquist, On the geometry of the 
Kimura-Georgiou parameterization of modelling filter, 
Inter. J. of Control 50 (1989) pp. 2301-2312. 
C.I. Byrnes and A. Lindquist, Toward a solution of the 
minimal partial stochastic realization problem, Comptes 
Rendus Acad. Sci. Paris, t. 319 (1989) pp. 2301-2312. 
C.I. Byrnes and A Lindquist, Some recent advances on the 
rational covariance extension problem, Proc. IEEE Euro- 
pean Workshop on Computer Intensive Methods in Control 
and Signal Processing, Prague (1994), pp. 149-158. 
C.I. Byrnes and A. Lindquist, On the partial stochastic 
realization problem, IEEE Trans. Automatic Control AC-42 
(1997), to appear. 

C.I. Byrnes and A. Lindquist, On a duality between filtering 
and interpolation, System and Control in the 21st Century, 
pp. 101-136. 

C.I. Byrnes, A. Lindquist, S.V Gusev and A.V. Matveeve, 
The geometry and positive real functions with applications 
to the rational covariance extension problem, Proc. 33rd 
Conf. on Decision and Control, 3883-3888. 
C.I. Byrnes, A. Lindquist, S.V. Gusev and A.V. Matveeve, A 
complete parameterization of all positive rational extensions 
of a covariance sequence, IEEE Trans. Automatic Control 
AC-40 (1995), pp. 1841-1857. 

C.I. Byrnes, A Lindquist, and T McGregor, Predictability 
and unpredictability in Kalman filtering, IEEE Transactions 
Auto. Control AC-36 (1991), pp. 563-579. 



C.I. Byrnes, A Lindquist, and Y. Zhou, Stable, unstable and 
center manifolds for fast filtering algorithms, Modeling, 
Estimation and Control of Systems with Uncertainty (G.B. 
Di Masi, A. Gombani, and A. Kurzhanski, eds.), Birkhauser 
Boston, Inc. (1991). 

C. I. Byrnes, A. Lindquist, and Y. Zhou, On the nonlinear 
dynamics of fast filtering algorthms, SLAM J. Control and 
Optimization, 32 (1994) pp. 744-789. 

L.O. Chau, C.A. Desoer and E.S. Kuh, Linear and Nonlin- 
ear Circuits, McGraw-Hill (1989). 
G. Heinig, P. Jankowski and K. Rost, Fast Inversion Algo- 
rithms of Toeplitz-plus-Hankel Matrices Numerische Math- 
cmatik 52 (1988)j pp. 665-682. 

W.B. Kleijn, D.J. Krasinski and R.H. Ketchum, Fast meth- 
ods for the CELP speech coding algorithm, IEEE Trans. 
Acoustics, Speech, and Signal Processing 38 (1990), pp. 
1330-1332. 

D. G. Luenberger, Linear and Nonlinear Programming (Sec- 
ond Edition), Addison-Wesley Publishing Company, Read- 
ing, Mass. (1984). 

J.D. Markel and A.H. Gray, Linear Prediction of Speech, 
Springer Verlag, Berlin (1976). 

M. Minoux, Mathematical Programming: Theory and Algo- 
rithms, John Wiley and Sons, New York (1986), pp. 94-95. 
B. Porat, Digital Processing of Random Signals, Prentice 
Hall (1994). 

D. Qu&rmby, Signal Processing Chips, Prentice Hall (1994). 
L.R. Rabiner, B.S. Atal, and J.L. Flanagan, Current methods 
of digital speech processing, Selected Topics in Signal 
Process (S. Haykin, editor), Prentice Hall (1989), pp. 
112-132. 

L.R. Rabiner and R.W. Schafer, Digital Processing of 
Speech Signals, Prentice Hall, Engelwood Cliffs, N.J. 
(1978). 

Vostry, Zdnek, New Algorithm For Polynomial Spectral 
Factorization with Quadratic Convergence I, Kybernetica 
77 (1975). 

TR45 Full-Rate Speech Codec Compatibility Standard 
PN-2972, Electroic Industries Association, 2001 Eye Street 
N.W., Washington, D.C. 20006 (1990). 
F.J. Taylor, Digital Filter Design Handbook, Prentice Hall, 
N.J. (1994) pp. 340-347. 



11/06/2003, EAST version: 1.4.1 



U.S. Patent Aug. 17, 1999 Sheet 1 of 3 5,940,791 




FIG . 1 

PRIOR ART 



10 | 1 r 




I 1 1 1 I 

0 0.2 0.4 0.6 0.8 

FREQUENCY 

FIG . 2 

PRIOR ART 



10 




-60 I 1 1 1 i 

0 0.2 0.4 0.6 0.8 

FREQUENCY 

FIG . 3 



11/06/2003, EAST Version: 1.4.1 



U.S. Patent 



Aug. 17, 1999 



Sheet 2 of 3 



5,940,791 




r — 






— u 


O m ■ 




FIG . 4 



A/D 



1. CORRELATION 
COMPUTATION 



5. EXCITATION 
SIGNAL 
SELECTION 



3. LLN 
FILTER 
ANALYSIS 



CODING PART 



_L 



DECOOING PART 



6. EXCITATION 
GENERATOR 



7. LLN 
FILTER 



2. ZERO 
COMPUTATION 



1. INITIAL 
CONDITION 



0/A 



FIG . 5 



11/06/2003, EAST Version: 1.4.1 



U.S. Patent Aug. 17, 1999 Sheet 3 of 3 



INITIAL SOLUTION 
FROM BLOCK 4 



A( z ) 



B( z ) 



CALCULATION 
OF COVARIANCE 
ESTIMATES C 



± 



<(l c-cio 



NO 



A( z ).B( Z ) 



YES 



i 



DETERMINING 
FILTER PARAMETERS 



AC z ) 



UPDATING 
A( z ) 



qo«°r .°n-l 



FIG . 6 



PULSE 




GENERATOR 





NOISE 


7 


GENERATOR 






F IG . 7 



11/06/2003, EAST version: 1.4.1 



5,940,791 



10 



15 



METHOD AND APPARATUS FOR SPEECH 
ANALYSIS AND SYNTHESIS USING 
LATTICE LADDER NOTCH FILTERS 

BACKGROUND OF THE INVENTION 

In speech synthesis, the speech pattern is usually divided 
into frames of a few tens of ms. Conventional vocoders 
identify stationary speech phonemes into voiced and 
unvoiced speech (see, e.g., U.S. Pat. Nos. 2,151,091 and 
2,243,526). In the art, a single phoneme is regarded to 
evolve on the order of magnitude of 100 ms. On each of the 
smaller windows, a linear filter — or digital circuit — is then 
designed to model the vocal tract. A voiced speech signal 
can be modeled, and regenerated,, as a deterministic process 
obtained by passing a ^quasi-periodic signal containing the 
appropriate pitch frequencies through a linear filter. 
Analogously, an unvoiced speech signal is modeled, and 
regenerated, as a random signal obtained by passing white 
noise through the same linear filter, which models the vocal 
chords. In this time frame, the parameters characterizing the 
linear filter as an input/output device are identified using, for 
example, methods from linear prediction coding (LPC) filter 
design, and encoded for regeneration. For applications in 
cellular telephone communications using pitch-excited 
vocoders, in this same window, the speech pattern is seg- 
mented into an identified sequence, also encoded for 
regeneration, of voiced and unvoiced phonemes. In some 
popular forms of vocoders, for each unvoiced expression a 
code book, or look-up table, of white noise signals is 
searched for that signal which, when passed through the 
LPC filter, regenerated the response closest to the sampled 
unvoiced signal. The code for this signal is then transmitted 
for regeneration. A similar procedure is performed for 
voiced signals with^a_periodic pulse train signals in lieu of 
white noise. Here, however, the vocoder must also perform 
pitch detection m orderjo.regenerate the voiced signal. 

(Linear Predictive Coding (LPC) can 1 be used in a variety 
of different speech coders, such as pitch-excited vocoders, 
voice-excited vocoders, waveform coders, analysis-by- 
synthesis coders, and frequency-domain coders (see T. P. 
Barnwell III, K. Nayebi and C. H. Richardson, Speech 
Coding: A Computer Laboratory Textbook, John Wiley & 
Sons, New York, 1996, at 85), and the invention disclosed 
herein can be used in all these contexts and is not confined 
to a particular vocoder architecture. In fact, LPC filters, 
sometimes referred to as maximum entropy filters, in 
devices for such digital signal processing and speech syn- 
thesis (see, e.g., U.S. Pat. Nos. 4,209,836 and 5,048,088; D. 
Quarmby, Signal Processing Chips, Prentice Hall, 1994; and 
L. R. Rabiner, B. S. Atal, and J. L. Flanagan, Current 
Methods of Digital Speech Processing, Selected Topics in 
Signal Processing, S. Haykin, editor, Prentice Hall, 1989, 
112-32) have been used in the prior art. 

The circuit, or integrated circuit device, whiclf imple-? 55 
^ments the LPC filter is designed and fabricatedlising ordi- 
nary skill in the art of electronics (see, e.g., U.S. Pat. Nos. 
4,209,836 and 5,048,088) on the basis of the specified 
parameters (specs) which appear as coefficients (linear pre- 
diction coefficients) Jn the mathematical description go 
(transfer function) (of^ the LPC filter.^For example, the 
expression of the specified ~parameters_(specs) is often 
conveniently displayed in_the_lattice]fi^rrepresentation of 
the circuit shown in FIG. 1, containing unit delays z" 1 , 
summing junctions, and gains. 

This is also known as a PARCOR system. The gain 
(PARCOR) parameters, which are also the reflection coef- 
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ficients of the random signal (as in FIG. 1), are easily 
determined from the speech waveform. The design of the 
associated circuit is immediate with ordinary skill in the art 
of electronics. In fact, this filter design has been fabricated 
by Texas Instruments, starting from the lattice filter 
representation, and is used in the LPC speech synthesizer 
chips TMS 5100, 5200, 5220 (see, e.g., Quarmby, Signal 
Processing Chips, supra, at 27-29). 

The two advantages of LPC filter design are that it is 
possible to find parameter specs so that the LPC filter 
produces a signal which reproduces much of the observed 
spectral properties, and that there exists algorithms for 
finding the filter parameters from the spectral properties of 
the observed speech form. FIG. 2 shows a periodogram 
determined from a frame of speech data together with the 
power spectral density of a 6th order LPC filter designed 
from this frame. 

A disadvantage of the LPC filter is that its power spectral 
density cannot match the "valleys," or "notches," in the 
periodogram and results in speech which is rather "flat," 
reflecting the fact that the LPC filter is an "all-pole model." 
This is related to the technical fact that the LPC filter only 
has poles and has no transmission zeros. To say that a filter 
has a transmission zero at a frequency £ is to say the filter, 
or corresponding circuit, will absorb damped periodic sig- 
nals which oscillate at a frequency equal to the phase of t, 
and with a damping factor equal to the modulus of This 
is the well-known blocking property of transmission zeros of 
circuits (see, e.g., L. O. Chua, C. A. Desoer and E. S. Kuh, 
Linear and Nonlinear Circuits, McGraw-Hill, 1989, at 659). 
This-technical fact is reflccted-in-the fact, illustrated in FIG. 
2, that the power spectral density of the LPC filter will not 
match the periodogram at frequencies near its notches. It is 
also widely appreciated in the signal and speech processing 
literature that regeneration of human speech requires the 
design of filters having zeros, without which the speech will 
sound flat or artificial (see, e.g., C. G. Bell, H. Fujisaki, J. M. 
Heinz, K. N. Stevens and A. S. House, Reduction of Speech 
Spectra by Analysis-by-Synthesis Techniques, J. Acoust. 
Soc. Am. 33 (1961), at 1726; J. D. Markel and A. H. Gray, 
Linear Prediction of Speech, Springer Verlag, Berlin, 1976, 
at 271-72; L. R. Rabiner and R. W. Schafer, Digital Pro- 
cessing of Speech Signals, Prentice Hall, Englewood Clifls, 
N.J., 1978, at 105 and 76-78). Indeed, while all-pole filters 
can reproduce much of human speech sounds, the acoustic 
theory teaches that nasals and fricatives require both zeros 
and poles (see, Markel et al., Linear Prediction of Speech, 
supra, at 271-72; Rabiner et al., Digital Processing of 
Speech Signals, supra, at 105). 

This observation is a partial motivation for the device 
disclosed in U.S. Pat. No. 5,293,448, in which a zero filter 
is used as a prefilter to the all pole filter to generate higher 
quality voiced signals. However, the lack of a clear and 
useful delineation of the extent to which zeros may be 
arbitrarily assigned to implementable for linear filters both 
voiced and unvoiced speech has remained a limiting factor 
in the design of improved devices for signal and speech 
processing. 

SUMMARY OF THE INVENTION 

The present invention comprises a method and device for 
the systematic design of linear filters having both desired 
zero structure and the standard spectral features required to 
regenerate the appropriate voiced and unvoiced signals. The 
inventors' implementations and tests suggest that the pre- 
selection of nontriviaJ zero structure improves the regenera- 
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tion of human speech. Applying this methodology for filter This method and design has application in speech syn- 
design to the same speech data and periodogram as in FIG. thesis for another reason, especially in applications involv- 
2, an alternative 6th order filter is obtained, whose power ing the compression and transmission of speech as it occurs, 
spectral density is less flat and provides a better for example, in digital cellular telephone communications, 
approximation, reflecting the fact that the filter is designed 5 We have already described the use of LPC filters in speech 
to have transmission zeros near the minima of the peri- synthesis in a pitch-excited vocoder environment and noted 
odogram (see FIG. 3). the need for filters, such-as LLN filters, which both have 
In automatic control, filters having prescribed transmis- desired transmission zeros and match the covariances deter- 
sion zeros are referred to as notch filters. Notch filters have mined from speech data. LLN filters will enjoy this advan- 
been used as dynamic compensators to attenuate stable 30 ta S e in use in a variety of other vocoders, such as voice- 
harmonic fluctuations by achieving desired pole-zero can- excited vocoders, waveform vocoders, analysis-by- 
cellations. Although the desirability of notches in the power synthesis vocoders and frequency-domain vocoders. In 
spectrum of the filters disclosed in the present invention addl i lon > for s Pf ch ^thesis using one of the analysis-by- 
plays a quite different role for signal processing and speech ^ e * is vocod , ei ?' a ^ode-excited linear predictor coder 
syndesis than the role played by notch filters in automatic 15 fi^ ™ 
control the analogy supsts that we refer to the filters devke for ^ M wm be$t * 
disclosed here as LLN filters This acronym also reflects the erate the observed h rf { The identification ^ 
lattice-ladder architecture which can be used to implement or code number> of this signal fe ^ Emitted along with 
these filters, as disclosed below. Lattice-ladder architecture a faclor and the LPC gains t0 the rece ivi n g device for 
representations of various filters occur in prior art, but the 20 regeneration. Currently the most computational, and time, 
lack of either a method or a means of tuning the filter intensive activity during this window is the search through 
parameters in this representation to match the desired spec- look-up tables to find the best signal with which to reproduce 
tral properties and the notches in the spectrum of the speech the observed signal using an LPC filter. By using a mare 
waveform has prohibited its use in speech processing for accurate spectral envelope, through the method disclosed 
high quality speech analysis and synthesis. Such method and 2 s herein, for the spectrum of the observed speech waveform, 
means are disclosed herein. We note that an LLN filter is it is expected to be able to reduce the number of candidate 
defined, independent of the architecture used to represent it, signals with which to reproduce the observed signal, thereby 
by the specification of the filter zeros and of the window of shortening the most time-intensive part of the encoding and 
covariances computed from the speech frame. The lattice- speech compression activity. 

ladder architecture is the implementation of one particular 30 It is expected that this invention will have application as 

state-space representation of the LLN filter transfer function, an alternative for the use of LPC filter design in other areas 

and the theory of linear systems teaches that each such state of signal processing and statistical prediction including, for 

space representation (which when minimal dimension are all example, speech compression, speech synthesis from stored 

equivalent under a state-space transformation) Linear coded speech, or text-to-speech synthesis. In each of these 

Systems, T. Kailath, Prentice-Hall, NJ. 1980 (pages 35 areas, the same fundamental spectral estimation problem 

363-365) leads directly to an equivalent architecture repre- arises. 

sented in terms of gains, summing junctions and unit delays While the principal advantages and features of the present 

(see e.g. Kailath, pages 93-94). For purposes of illustrating invention have been briefly explained and illustrated in the 

the preferred embodiment, we teach the implementation of foregoing discussion, a greater understanding and apprecia- 

LLN filters in terms of the lattice ladder architecture. It is 40 tion for the various aspects of the invention may be attained 

also disclosed here that LPC filters, and their representation by referring to the drawings and the description of the 

in a lattice filter architecture, can be obtained as a special- preferred embodiment which follow, 

ization of the disclosed LLN filter design. _ _ 

Tt . . . , . # . - # . - . * . . BRIEF DESCRIPTION OF THE DRAWINGS 

It is widely appreciated in the signal and speech process- 

ing community that regeneration of human speech requires 45 FIG - 1 & a schematic diagram of a lattice representation 

the design of filters having zeros. The effect of zeros is to of m LPC filter ^ known in the prior art; 

enable the power spectral density to provide a spectral FIG. 2 is a periodogram determined from a frame of 

envelope which better fits the speech waveform. For speech data together with the power spectral density of a 

example, this enables the design of a filter which is sensitive sixth order LPC filter designed from this frame; 

to the location of the source of the speech within the vocal 50 FIG. 3 is the periodogram of FIG. 2 with the power 

tract as well as providing a model encompassing the nasal spectral density of an alternative sixth order filter of the 

tract (see, Bell et al., Reduction of Speech Spectra by present invention; 

Analysis-by-Synthesis Techniques, supra, at 1726-27; and FIG. 4 is a schematic diagram of a lattice- ladder repre- 
Markel et al., Linear Prediction of Speech, supra, at sentation the present invention (LLN filter); 
271-72). For such reasons, the method and device for the 55 FIG. 5 is a schematic diagram of a speech coder corn- 
systematic design of linear filters having both desired zero prising a speech analyzer portion and a speech synthesis 
structure and the standard spectral features required to portion; 

regenerate the appropriate voiced and unvoiced signals has FIG. 6 is a block diagram, or flow chart, of the math- 

apphcation m the area of high quality speech synthesis. ematical process required to determine the LLN filter param- 

For the same reasons, this method and design may be used 60 etenv and 

in devices for speech recognition, by providing a device for FI a 7 is a schematic diagram of an excitation generator 

both developing filter parameters from observed speech and for a p i tc h-excited vocoder, 
comparing the filter parameters so-obtained with parameters 

already stored for particular individuals. The LLN filter DETAILED DESCRIPTION OF THE 

design disclosed herein uses zeros as well as poles, while the 65 PREFERRED EMBODIMENT 

LPC filter is an all-pole filter, and should provide more Conventional vocoders perform three tasks: determining a 

accurate encoding and identification of human speech. linear filter which models the vocal tract, determining an 
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appropriate model of the excitation signal, and the regen- 
eration of the observed signal when the excitation signal is 
generated and used to excite the linear filter. Naturally, tasks 
such as speech compression and transmission are accom- 
plished by encoding and decoding the filter parameters and 
the identifying characteristics of the excitation signal within 
a class of signals. As discussed above, there are standard 
electronic devices which perform each of these three tasks, 
both separately and in an integrated form, in a variety of 
vocoder architectures. Also discussed is the fact that the LPC 
filter is the most commonly used linear filter in the prior art 
for modeling the vocal tract. The basic design feature of such 
a filter is that it produces a synthetic speech signal whose 
power spectrum maLches the first n+1 co variance coeffi- 
cients c 0 , c A , C 2 , . . . , C„ extracted from the speech data in 
the window; i.e., besides being stable, the transfer function 
w(z) of the LPC filter satisfies 

|gi(c'' 0 )| 2 -V2* 1 cos8+2£ 2 cos 26+ ... , (2.1) 

where 



£ A -c 4 for Jb-0,1, . . . , n (2.2) 

(For example, for unvoiced speech, this means that the LPC 
circuit produces a random signal, with specified first n+1 
covariances c 0 , C 1 , . . . , C rt , when driven by white noise.) 
It is this property which makes the LPC filter a decent model 
of the vocal tract. 

However, as pointed out above, the drawback of the LPC 
filter is that it is an all -pole filter. Atal has pointed out that 
the perceived differences between real speech and the best 
synthetic speech obtainable using an LPC filter are at least 
partially due to the all-pole model restriction (see, Markel et 
al., Linear Prediction of Speech, supra, at 271). The need for 
introducing zeros has led to many ad hoc methods, but the 
absence of a systematic design methodology for incorporat- 
ing appropriate zeros is a limiting factor in high quality 
speech analysis and synthesis (see, e.g., Markel et aL, Linear 
Prediction of Speech, supra, at 271-275). An advantage of 
the present invention is that it includes a filter which satisfies 
the same covariance properties (2.1)-(2.2) as the LPC filter, 
but for which the zeros can be set arbitrarily. 

In fact, in this disclosure it is described how to design a 
filter, and how to implement this filter with a circuit, which 
will be useful in signal processing and speech synthesis. 
This shall be referred to as a LLN filter, because it is a notch 
filter, designed to have prescribed zeros, and can be 
implemented, for example, using a lattice-ladder 
architecture, as described in detail below. The circuit meets 
the following specifications. 

(i) For the covariance coefficients Cq, C 3 , C^, . . . , C„ 
extracted from the speech data in the window, the 
(stable) transfer function of the LLN filter satisfies 

|co(e / °)P=^ 0 +2a i cos0+2<? 2 cos 26+ ... , (2.3) 

where 



t k c k for Jfc-0,1, n. (2.4) 

(ii) The filter has prescribed zeros, ^ . . . , £„, inside the 
unit disc. 

In specifying the covariance coefficients, we also include 
equivalent forms of specifying these statistical data, such as 
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the representation of covariance data by PARCOR 
parameters, reflection coefficients, Schur parameters or their 
statistical equivalents. In specifying the filter zeros, we also 
include equivalent forms of specifying zeros, such as the 

5 coefficients of the zero polynomial, the zero polynomial or 
any of their functional equivalents. These, and any other 
equivalents, would be understood by one of ordinary skill in 
the art as included within the scope of the invention. For 
brevity, the inventors shall use the terms covariance and 

10 zero, and these terms shall be understood as being broadly 
defined to include these and other such equivalents. 

When represented in the lattice -ladder architecture, the 
LLN filter is determined by the parameters shown in FIG. 4. 
As for the lattice representation of the LPC filter, the LLN 

15 filter consists of gains, which are the parameter specs, unit 
delays z" 1 , and summing junctions and therefore can be 
easily mapped onto a custom chip or onto any program- 
mable digital signal processor (e.g., the Intel 2920, the TMS 
320, or the NEC 7720) using ordinary skill in the art (see, 

20 e.g., Quarmby, Signal Processing Chips, supra, at 27-29). 
It is observed that the lattice-ladder filter representation 
builds on the lattice filter topology, but provides for the 
incorporation of the spec parameters denoted by p, which 
allow for the incorporation of zeros into the filter design. In 

25 fact, the lattice filter representation of an all-pole (LPC) filter 
can be designed from the lattice-ladder filter architecture 
representing an LLN filter by setting the parameter specifi- 
cations: 

30 Pa**,"* Pi=P 2 = ■ • ■ "P„-0 and a k ^ k for k=0,l, . . . , n-1. 

While it has been appreciated that the additional features of 
the lattice-ladder filter make it possible to design filters with 
nontrivial zero structure, the limiting factors in this design 

35 strategy have been the lack of knowledge as to how arbi- 
trarily one could make the specifications of the filter zeros, 
how may filters (if any) achieve this design specification, 
and how to design such filters if they do indeed exist. The 
first of these questions has been successfully addressed in 

40 the literature by T. T. Georgiou, Realization of Power 
Spectra from Partial Covariance Sequences, IEEE Trans, on 
Acoustics, Speech and Signal Processing, ASSP-35 (1987) 
438-449, where it is shown that such filters exist for any 
choice of zeros (inside the unit disc). The second theoretical 

45 issue was resolved in the literature in the publication CI. 
Byrnes, A. Lindquist, S. V. Gusev and A. V. Matveev, A 
Complete Parameterization of All Positive Rational Exten- 
sions of a Covariance Sequence, IEEE Trans. Automatic 
Control AC-40 (1995), 1841-1857, where it is also shown 

50 that the choice of zeros can be made arbitrarily while 
maintaining the ability to match the covariance coefficients 
and that this can be done in only one way, so that the 
resulting filter is uniquely determined by these design speci- 
fications. These theoretical results were purely existence 

55 arguments and did not, however, resolve the issue of how to 
effectively compute the gains an implementable digital filter; 
for example, in the lattice-ladder representation we propose 
here. This left the long felt need in the art for a solution to 
this problem which would permit the design and use of an 

60 appropriate filter for speech analysis and synthesis. 

The present invention answers the third question and 
includes a method for prescribing the zeros, so that it is 
possible to design and implement the unique filter which can 
produce output signals with the desired spectral properties 

65 and a spectral envelope with the desired notches. We also 
disclose that this filter can be represented in a lattice-ladder 
architecture and that an efficient algorithm for the compu- 
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tation of the (LLN) filter parameters is disclosed along with • ■ • (z-^)^ V+ ■ • ■ +K- (2.6) 
a method for determining the filter zeros from observed data. 

In the present method, the filter poles are completely deter- So that ^ resulting filter will be implementable, the zeros 

mined by the observed correlation coefficients (i) and the are restricted to be self -conjugate, i.e., so that for each i there 

prescription of transmission zeros (ii). Also disclosed is the 5 exists a J 50 that where the bar represents complex 

fact that the filter poles cannot, in general, be prescribed conjugation. Consequently, the coefficients b ( - of the zero 

first, with the zeros then determined by the observed data. polynomial B(z) are all real numbers. 

This distinguishes the disclosed method and the resulting II 1S the standard practice, when using an all-pole filter to 

circuit design and the method and design disclosed in U.S. use additional poles in order to compensate for the lack of 

Pat. No. 5,293,448, wherein a lattice filter representation of 10 zeros ( see » Ra Wner et al., Digital Processing of Speech 

a zero filter is tuned to the prior choice of the all-pole (or Signals, supra, especially at Section 3.4.1, p. 99). The 

LPC) filter, represented for example in ladder form. Another present method is based on the capability of using an LLN 

advantage of the present invention is in the use of zeros to filler of lower order than would be required by the LPC filter 

reproduce unvoiced, as well as voiced, speech. and takin g advantage of the availability of additional auto- 

With the present method and circuit design, it is possible is correlation data to determine the zero structure by standard 

to incorporate the preselection of arbitrary transmission systems and signal processing algorithms, 

zeros into the design specifications for a (notch) filter which Usin S the Autocorrelation Method (see, Barnwell III et 

reproduces an approximation for an observed signal when al » Speech Coding: A Computer Laboratory Textbook, 

driven by the appropriate excitation signal, and there exists su P ra > at 91-93), or some version of Burg's algorithm as 

algorithms for finding the filter parameters 20 tau S ht in B - Porat ' Digital Processing of Random Signals, 

Prentice Hall, 1994, at 176, first compute the reflection 

coefficients 

etc, a u . . . , and p 0 , p l3 . . . , ft, (2.5) 



from the design specifications. 

FIG. 5 illustrates a possible speech coding environment of 
the invention. Each box in this flow chart represents an 
algorithm or computation which can be found in existing 
software packages or hardware. Disclosed below is the 
inventors' best implementation of such algorithms. 

The block diagram in FIG. 5 consists of two parts. The top 
part is the transmitter (analyzer), which computes the filter 
parameters, gain, and choice of excitation model as 
explained below. These parameters are coded, in the usual 
manner, and transmitted to the receiver (synthesizer) and 
decoded. The receiver consists of an excitation generator 
providing the appropriate input to the LNN filter (see FIG. 
4), thus set with the appropriate parameters. 

To improve the numerical properties of the filter compu- 
tation the digitized signal may be passed through a pre- 
emphasis filter to reduce the dynamical range of the speech 
spectrum (see, Barnwell III et al., Speech Coding: A Com- 
puter Laboratory Textbook, supra, at 86-87). To compensate 
for this, the inverse filter (deemphasis filter) is placed at the 
output of the LNN filter. To simplify matters, in FIG. 5, these 
filters are included in the A/D and D/A blocks, respectively, 
which themselves may be omitted if the source and result are 
desired to be digital signals. 

Determination of the Covariance Coefficients 

Box 1 determines the covariance sequence c 0 , Cj, . . . , c„ 
from the digital waveform and is standard practice using 
ordinary skill in the art. For example, the Autocorrelation 
Method can be used (see, Barnwell III et al., Speech Coding: 
A Computer Laboratory Textbook, supra, at 91-93). Also, 
U.S. Pat. No. 4,775,951 discloses a device for computing 
correlation functions and coefficients. 

Determination of the Transmission Zeros 

Box 2 determines the filter zeros from the digital 
waveform, if in fact the zeros have not been prescribed as 
part of the application specs. There are several alternatives 
for systematically choosing the zeros. First disclosed is the 
recursive algorithm which is believed to represent the best 
available state of the-art. 

It is desired to determine the set of desired transmission 
zeros, £2, . . . , £„. In fact, for present purposes it is 
actually needed to determine the corresponding zero poly- 
nomial 



Ym+l-n, Ym+2-n, . . . , ym+n 

for some m^n, i.e., the PAR COR coefficients succeeding 
those used in the LPC filter, and then solve the Toeplitz 
system 



7m 


7m-l 


" 7mtl-n 






7m+l 


7m* 1 


7m 


" 7m+2- n 


0-2 




7m*2 


. 7m+n~ 1 


7m+n-2 ' 


" 7m . 


.<r n . 




. 7m+-n . 



35 

for the parameters a lf a 2 , . - . , a„. If the polynomial 

<j(z)-f , -KI l f , " 1 + . . . +0„, 

40 

has all its roots inside the unit disc, simply set B(z)-o(z). 
More generally, take B(z) to be the stable spectral factor of 
Q(z):=a(z)oXz _1 ), computed as in Appendix A and normal- 
ized so that the leading coefficient (of z") is 1. 

45 Alternative methods can be based on any of the proce- 
dures described in Markel et al., Linear Prediction of 
Speech, supra, at 271-75, including Prony's method with 
constant term. As pointed out above, these methods are not 
by themselves good for producing synthetic speech, because 

50 they do not satisfy (2,l}-(2.2). However, here only the zero 
computation is used, and in Box 3 the poles are determined 
to satisfy (2.1)-(2.2). As another alternative, the choice of 
zeros can also be done by determining the phase of the zeros 
from the notches in the observed spectrum, as represented by 

55 a periodogram or as computed using Fast Fourier Trans- 
forms (FFT), by any standard algorithm for determining 
local minima. 

Coding of the LLN Filter Parameters 

The key in the present algorithm is to be able to system- 

60 atically determine the gain parameters (2.5) to match the 
observed covariance data and to have prescribed transmis- 
sion zeros. This is done in Box 3. 

As a result of the basic theory developed in C. I. Byrnes 
and A. Lindquist, An Algebraic Description of the Rational 

65 Solutions of the Covariance Extension Problem, Linear 
Circuits, Systems and Signal Processing, C. I. Byrnes, C. F. 
Martin and R. E. Saeks (editors), Elsevier 1988, 9-17; C. I. 
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QC^V) for all 8t [-*,*]. 



(2.10) 



The present invention shows that there is a unique such 
minimum. Given the minimizing (q 0 ,qi, . . . , qj and the 
corresponding pseudo polynomial (2.9), next determine the 
stable polynomial 



Byrnes and A. Lindquist, On the Geometry of the Kimura- 
Georgiou Parameterization of Modeling Filter, Inter. J. of 
Control 50 (1989), 2301-2312; C. I. Byrnes and A. 
Lindquist, Toward a Solution of the Minimal Partial Sto- 
chastic Realization Problem, Comptes Rendus Acad. Sci. 
Paris, t. 319, Sene I (1994), 1231-1236; C. I. Byrnes and A. 
Lindquist, Some Recent Advances on the Rational Covari- 
ance Extension Problem, Proc. IEEE European Workshop 
on Computer Intensive Methods in Control and Signal 
Processing, Prague 1994, 149-158; C. I. Byrnes and A. 
Lindquist, On the Partial Stochastic Realization Problem, 
IEEE Trans. Automatic Control AC-42 (1997); C. I. Byrnes, 
A. Lindquist, and T. McGregor, Predictability and Unpre- 
dictability in Kalman Filtering, IEEE Transactions Auto. 
Control AC-36 (1991), 563-579; C. I. Byrnes, A. Lindquist, 

and Y. Zhou, Stable, Unstable and Center Manifolds for Fast 15 which can be done by any of the procedures described in 



10 



satisfying 



(2.11) 



(2.12) 



20 



25 



30 



Filtering Algorithms, Modeling, Estimation and Control of 
Systems with Uncertainty (G. B. Di Masi, A. Gombani, and 
A. Kurzhanski, eds.), Birkhauser Boston Inc., 1991; and C. 
I. Byrnes, A. Lindquist, and Y. Zhou, On the Nonlinear 
Dynamics of Fast Filtering Algorithms, SLAM J. Control 
and Optimization, 32 (1994), 744-789, and culminating in 
Byrnes et al., A Complete Parameterization of All Positive 
Rational Extensions of a Covariance Sequence, supra, at 
1841-1857; C. I. Byrnes, A. Lindquist, S. V. Gusev and A. 
V. Matveev, The Geometry of Positive Real Functions with 
Applications to the Rational Covariance Extension Problem, 
Proc. 33rd Conf. On Decision and Control, 3883-3888; C. 
I. Byrnes and A. Lindquist, On a Duality Between Filtering 
and Interpolation, Systems and Control in the 21st Century 
(C. I. Byrnes, B. N. Datta, D. Gilliam, C. F. Martin, eds.) 
101-136; C. I. Byrnes, H. J. Landau and A. Lindquist, On 
the Well-Posedness of the Rational Covariance Extension 
Problem, Current and Future Directions in Applied 
Mathematics, M. Alber, B. Hu and J. Rosenthal (editors), 
Birkhauser Boston, 1997, 83-108; and C. I. Byrnes, S. V. 
Gusev and A. Lindquist, A Convex Optimization Approach 35 
to the Rational Covariance Extension Problem, it is known 
that for any window of correlation coefficients, as above, and 
any preselection of transmission zeros, there exists one and 
only one linear filter having both desired zero structure and 
the standard statistical features required to regenerate the 
observed random signal. The present invention comprises a 
method for systematically designing the specified param- 
eters for the linear filter which achieves this. The basis for 
this systematic design method, and the resulting device, is 
the discovery that the filter parameters can indeed be 
obtained from a standard kind of optimization procedure, 
applied to a formulation of the problem the inventors have 
discovered and now disclose. 

There are two sets of inputs to Box 3, the covariance 



Appendix A. Then the present invention shows that the 
transfer function 



w(t) = 

2A(z) 



(2.13) 



satisfies the specifications (i) and (ii) for an LLN filter. 
Finally, determine the gain parameters (2.5) in the following 
way: 

For k«n, n-1, . . . , 1, solve the recursions 



Ok-ij -a kJ + a k .ia kJl .j, a^^aj (2.14) 

<*kO 

bk-ij = by - A fl w-y. km = bj 



For j =0,1, . . . , k, and set P^+b^a^. 

Passing from the parameterization (2.13) to the param- 
eterization (2.14) is a well-known procedure (see, e.g. K. J. 
40 Astrom, Introduction to Stochastic Realization Theory, Aca- 
demic Press, 1970; and K. J. Astrom, Evaluation of Qua- 
dratic Loss Functions for Linear Systems, Fundamentals of 
Discrete-time systems: A Tribute to Professor Eliahu I. Jury, 
M. Jamshidi, M. Mansour, and B. D. O. Anderson (editors), 
IITSI Press, Albuquerque, N.M., 1993, 45-56). This algo- 
rithm is recursive, using only ordinary arithmetic operations, 
and can be implemented with an MAC mathematics pro- 
cessing chip using ordinary skill in the art. 

The minimization of (2.8) given the constraints (2.10) is 



45 



coefficients c 0 ,c lt . . . , c n , delivered by Box 1, and the set 50 a convex optimization problem for which there are many 



of transmission zeros (2.6), delivered by Box 2 and collected 
in the polynomial B(z). The third input from Box 4 may or 
may not be present, as explained below. Defining $ to be the 
objective function 



55 



#?o. qi ?„) = 



(2.8) 



60 



in the n+1 variables q 0 ,qj, . . . , q„, where 



0(r)- <?o +V^ 1 (z+z- l )+V^ 2 ( 2 2 +z- 2 ) + . . . +VS 9 „(2 rt +0, 



(2.9) 



standard algorithms and software that determine the mini- 
mizing (q 0 , q l9 . . . , q„) recursi vely. Mos t generic ^cpde s for 
convex optimization wiU^gij^ 
derivative) and/or Hessian (second derivative) for use in a 
recursive algorithm, such as defined, for example, by New- 
ton's method. However, for the specific problem of mini- 
mizing <j>, both the gradient and the Hessian can be computed 
directly, without computing the values of the function (2.8), 
using the computation of the covariances of an associated 
process and using Fourier transforms. While the covariance 
data are well known to be computable using just recursive 
algorithms employing only ordinary arithmetic operations 
(see, Porat, Digital Processing of Random Signals, supra, 
especially at 175-177), we disclose here that the computa- 



and log is the natural logarithm, minimize the objective 65 tion of the Fourier coefficients can be computed using 
function (2.8) over all choice of variables (q 0 ,q 3 , ■ ■ . , q„) recursive algorithms employing only ordinary arithmetic 
such that operations. For this reason, a direct application of Newton's 
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method with the computations of the gradient and Hessian In fact, Newton's method amounts to recursively updating 

described below gives an algorithm believed to be the best the vector q with components q 0 , q 19 . . . , q„ according to 

state of the art. The resulting recursive minimization the rule 
algorithm, using only arithmetic operations, has been 

encoded in MATLAB and can be implemented with an 5 

MAC mathematics processing chip using ordinary skill in fl^W+M** (2.22) 

^More recisel the adient is iven b where d* is the search direction (2.21) at the point q* (see, 

ore precise y, e gra len is given y e ^ M jyj moux> Mathematical Programming: Theory and 

Algorithms, John Wiley and Sons, New York, 1986, at 

— (q 0 , qi q n ) = c k (215) 10 94-95). Here the step size X k is chosen so that 

dqk (i) q* +1 satisfies the constraints (2.10), 

(ii) Wolfs Test (see, D. G. Luenberger, Linear and 

where Nonlinear Programming (Second Edition), Addison- 

15 Wesley Publishing Company, Reading, Mass., 1984 at 



■si> 



\B(e^ 9 )\ 2 (2.16) 214.), which amounts to checking that 

dB for it =0, 1,2 n 



(c-^ +1 )'<f fc *(l-6)(e-c*)7f fc 



are the covariances corresponding to a process with spectral 2 o 

density is fulfilled for some specified 0<6< 1 /4, where c* denotes 

c in step k and ' denotes transpose, 

\B(e* 9 f 00 (2.17) where test (ii) is optional. Take 7^=1 as long as these 

jp- = c 0 + 2^] c k cos(k0). conditions are satisfied; otherwise successively take half as 

4=1 25 long a step until conditions (i) and (ii) are satisfied. Test 

(2.10) in a preselected number of points on the interval [-Jt, 

JiV^aM thus the difference between the given partial *) andA) / the 2™*™? test performed in conjunc- 

Wariance sequence and the partial covariance sequence ll 0 OD ™ J determining c (see Appendix A). The initial point 

corresponding to the choice of variables q 0 , q a , . . . , q„ at q caa be obtamed via 

which the gradient is calculated. The minimum is attained 30 

when this difference is zero. The covariances c 0 , c" lf . . . , c„ lAV^-V+fi 0 cose + q 2 ° cos28 + . . . -k^ 0 cos ne, (2.23) 
can be determined, via ordinary arithmetic operations, by 

first performing the factorization (2.12) by one of the where A°(z) is an arbitrary stable polynomial. For A°(z) one 

procedures disclosed in Appendix A and then applying the could choose the polynomial A(z) obtained by the LPC 

procedure described in Appendix B to A(z). 35 (maximum entropy) procedure, i.e., the Szego polynomial, 

To implement Newton's method, the Hessian of (2.8) is generated by the Levinson-Durbin algorithm (see, e.g., 

also needed, i.e., the matrix function of second derivatives of Byrnes et al., A Complete Parameterization of All Positive 

(2.8), i.e., Rational Extensions of a Covariance Sequence, supra, at 

1843-44; and Porat, Digital Processing of Random Signals, 

JWM. - • ■ . 0-*(W,) WiA - . . . « (2.18) 40 su P ra ' a J 159 > a ° d *PP«>Priately normalized by (r„T in in the 

' J 1 } terminology of Byrnes et al., A Complete Parameterization 

where 0 f ^// Positive Rational Extensions of a Covariance 

Sequence, supra. However, the preferred procedure is to 
1 r* ,JS(e' 9 )| 2 (2- 19) determine the initial condition A°(z) by a procedure 

fi — for k = 0, l, 2 2* 45 d e Scr ibcd below. This is done in Box 4. 

Given c from Box 1 and B(z) from Box 2, the preferred 
recursive selection method in Box 3 is described in the flow 
and h^-h*. Moreover, h 0 , hj, . . . , h^ are the 2n+l first chart of FIG. 6. 

Fourier coefficients of the spectral representation Starting with the initial pole polynomial, in each step first 

50 determine the vector c with components c 0 , c„ . . . , c M via 
|0(e^)| 2 ^ (2 20) (2.17) in the manner described above, taking Q(e / ' e )-|A(e /e ) 

~Qfe$f = *° * 2 2-j A * C0S W' | 2 ; see Appendix B. Next, test whether the current approxi- 

4=1 mation c of the partial covariance sequence c is within the 

tolerance e of c. If it is not, continue the recursive algorithm 
so, in the same way as above, the procedure of Appendix B 55 °y updating A(z). If it is, terminate the recursive steps and 
can be used to compute r^, r^, . . . , h^. Since the Hessian determine the filter parameters (2.5) via the recursions 
is the sum of a Toeplitz matrix and a Hankel matrix, the (2.14). The updating is performed by taking a Newton step 
search direction at the point q 0 , q a , . . . , q , i.e., (222) as described above, computed from the present A(z) 

by setting Q(d°)^\A(el^. The updated A(z) polynomial is 
d'-H~Hc-c) 60 oblained b y f actori ng the pseudo polynomial Q* +1 (z) corre- 

_ ^ 2 ' 21 ^ sponding to the updated point q* +1 by the procedures of 

(where c and c are the n+1 vectors with components Cq, Appendix A, thereby also checking that condition (i) is 
c lf . . . , c„ and c 0 , c lf . . . , c„ respectively) can then be fulfilled. 

determined directly or via a fast algorithm (see, G. Heinig, Determining the Initial Condition 
P. Jankowski and K. Rost, Fast Inversion Algorithms of 65 In Box 4, we determine, from the speech data, the initial 
Toeplitz-plus-Hankel Matrices, Numerische Mathematik 52 condition for the algorithm in Box 3. An obvious choice 
(1988), 665-82). would be the pole polynomial produced using a standard 
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LPC filter design procedure. The preferred initialization 
procedure is to form the signal x 0 , x 19 x 2> ... from the speech 
signal y 0 , y l9 y 2 , . . . and the given zero polynomial (2.6) via 
the recursion 



(2.24) 



by setting x,=0 for t<0, then to compute the sample cova- 
riances 



10 



d k = V x ,x t . k k = 0, 1, 2, ... 



(2.25) 



and finally take the initial A(z) to be the LPC pole polyno- 
mial corresponding to the. x signal, computed as above but 



with c n> c, 



, c„ exchanged for d 0 , d 3 , . 



Another good alternative procedure is obtained by run- 
ning the fast algorithm disclosed in Byrnes et al., On the 
Nonlinear Dynamics of Fast Filtering Algorithms, supra, 
backwards, using the PARCOR coefficients 



for some m^n, determined from the speech data as 
described above. For k=0, 1, 2, . . . , m, determine 



-r*+«-l7t+n-2 

1-7**2 



-r*+n-l7i+n-3 — -7k+n-lVk 



-yUn-l7ic+n-3 



-7i + «-27i 
-7* +n -37i 



30 



35 



and, given the zero polynomial (2.6), compute 













h 


= G 0 G l G 2 ...C m 








V 



Next determine the Szego polynomials <j) 1 (z) <j> 2 (z), . . . , 
<|>„(z) via the Levinson algorithm (see, e.g., Byrnes et al., A 
Complete Parameterization of All Positive Rational Exten- 
sions of a Covariance Sequence, supra, at 1843) and deter- 
mine g 0 by the procedure of Appendix B applied to 

Then, the polynomial 



fo,(z)+.*i*>»-iU) + •••+*») 



14 



can be used as an initial condition. 

Finally, one could also use any of the pole estimates 
discussed in Markel et. al, Linear Prediction of Speech, 
supra, at 271-275, such as, for example, the one obtained via 
Prony's method. (Prony's method is not good enough for a 
final solution but could be used for an initial value.) 

Excitation Signal Selection 

The output of Box 5 is the choice of an excitation signal, 
drawn from a class of model signals, which when used to 



excite the LLN filter will best regenerate the observed 
signal. The design of the LLN filter is not a vocoder design, 
but rather a technology which can be used in a broad class 
of speech analysis and synthesis architectures. Various 
vocoder schemes employ various kinds of excitation signal 
selections schemes, based on a variety of classes of model 
signals. For example, for a pitch-excited vocoder, the exci- 
tation signal selection consist of a decision, for each speech 
frame, whether the sound is voiced or unvoiced, and for 
voiced frames, the determination of a pitch period. Pitch 
detection is standard in the art (see Barnwell III et ah, 
Speech Coding: A Computer Laboratory Textbook, supra, at 
101 for discussion and further references). As another 
example for the CELP vocoder approach to analysis-by- 
15 synthesis, the excitation signal generator tests which signal 
from a code book of (typically) 1024 (or 10 bit) codewords 
produces a synthetic output which best matches the speech 
signal when passed through the LPC filter. Such selection 
procedures are standard in the art (see, e.g., Barnwell III et 
20 al., Speech Coding: A Computer Laboratory Textbook, 
supra, at 101 and 129-32; W. B. Kleijn, D. J. Krasinski and 
R. H. Ketchum, Fast Methods for the CELP Speech Coding 
Algorithm, IEEE Trans. Acoustics, Speech, and Signal Pro- 
cessing 38 (1990) at 1330-32; and TR45 Full-Rate Speech 
25 Codec Computability Standard /W-2972, Electronic Indus- 
tries Association, 2001 Eye Street, N.W., Washington, D.C. 
20006, 1990 at 22-32) when applied to LPC filters and can 
also be implemented for general digital filters, including, for 
example, LLN filters. 

In summary, in the speech analyzer disclosed herein, any 
of these excitation signal selector would perform the same 
determination of the best signal, using an LLN filter in lieu 
of the use of an LPC filter. 
The Excitation Generator 

Just as various speech analysis schemes employ various 
kinds of excitation signal selection methodologies, for 
speech synthesis each vocoder employs an excitation gen- 
erator based on its speech analysis methodology. In the 
speech synthesizer disclosed herein, any of these excitation 
40 generators would perform the same regeneration when inter- 
faced with an LLN filter, in lieu of an LPC filter intercon- 
nection. For the purposes of illustration, we describe the 
mechanisms of the excitation generator for an LLN filter 
employed in a pitch-excited vocoder. The pitch and voicing 
45 information of Box 5, the gain and the identification of the 
excitation signal selected in Box 5 are all coded and trans- 
mitted to the receiver where they are decoded and constitute 
the input to Box 6, as illustrated in FIG. 7. 

Here there is a switch which is set to either voiced or 
50 unvoiced sound, according to the information obtained from 
Box 5. If the sound is voiced, a pulse train with the pitch 
period determined in Box 5 is produced, and, if the sound is 
unvoiced, white noise is produced. The signal thus produced 
is passed through a gain, the setting for which is obtained 
55 from Box 5, and delivered as an input to the LNN filter of 
Box 7. The purpose of the gain setting is to control the 
amplitude of excitation signal and thus the loudness of the 
synthetic speech (see, Barnwell III et al, Speech Coding: A 
Computer Laboratory Textbook, supra, at 87-88). The 
arrangement of these circuit elements, but not the design and 
implementation of the present invention, is standard in the 
art. 

The LNN Filter 

The LLN parameters determined in Box 3 are coded and 
transmitted to the receiver where they are decoded. The 
LNN filter of Box 7 is depicted in FIG. 4, where the 
parameters (2.5) are those transmitted from Box 3, and 



60 



65 
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where the input is that produced in Box 6. The output of Box where 
7 is digital synthetic speech which may be transformed to an 

analogue speech signal by a D/A converter, after having i j (A .2) 

gone through a de -emphasis filter, if needed. a °~ J \~+o*T + crz 

As can be appreciated by those skilled in the art, the 5 \ +<r, + — + <r H 

present invention may be readily implemented in a 

microprocessor(s) suitably programmed to perform the indi- is the required polynomial spectral factor. Moreover, setting 
cated calculations. Furthermore, references have been made YaW - /**** 
above and throughout for all of the processing steps to 
existing software or programmable hardware to fully and 10 

completely implement the present invention. The present hJ <:l for i-QAX • • • (A.3) 

invention may also be implemented on custom designed . . . e .... . 4l _ 4 , 
chips, or otherwise using presently known design and imple- 15 a teSt f ° r P ° Sltmty > Le " that ^ hol6s ' 
mentation techniques for microelectronic circuitry for hard- Appendix B. Covariance Computation Subroutine 

ware or software implementation. As such, the present 15 

invention should not be limited to any particular hardware or Tim section describes how to determine the coefficients 
software implementation and instead may be implemented go> gi> ■ • • > g« m lne expansion 
in a variety of formats to suit the particular application 

thereof. me ptf « (B.l) 

The foregoing disclosure has been provided as illustrative 20 \A{e) 9 tf =g0 + 2 2j ** cos W 

of the preferred embodiment as contemplated by the inven- * =1 
tors and should not be considered as limiting. Instead, the 

scope of the invention should only be that as provided by the for any (stable) polynomials (2.6). and (2.11). 
claims appended hereto, and their equivalents. Begin by determining the coefficients f 0 , t l9 . . . , f 2M in the 

25 expansion 

Appendix A. Factorization Subroutine 

A subroutine used repeatedly above is to find the stable 1 = / 0 + 2Y /*cos(*0) 

polynomial (2.11) satisfying (2.12) for a given pseudo \AW 9 f £f 

polynomial (2.9). This is a standard factorization problem 30 

for which there are many known procedures (see, e.g., F. L. v „ ^ . . , 

Bauer, Ein direktes Iterationsverfahren zur Hurwitz- <™« s P<™hng to a LPC filter. This is done by first applying 
Zerlegung eines Polynoms, Arch. Elek. Ubertragung, 9 the ^vcr« Uvinson algorithm (see B. Porat, Digital Pro- 
(1955), 285-290). We believe the method described in cessin Z °f Random ^gnals f supra, at 165 and 47) to A(z) for 
Vostry, Zdenek, New Algorithm for Polynomial Spectral 35 ^P utin S reflection coefficients, and then the inverse 
Factorization with Quadratic Convergence I, Kybernetika, Schur algo^ilhrn (see B. Porat, Digital Processing of Ran- 
77 (1975), 411^18) is the best available art We also 
disclose another factorization algorithm which also yields a 
positivity test. M-files software in MATLAB have been 
developed. This algorithm is recursive, using only ordinary 
arithmetic operations, and can be implemented with an 
MAC mathematics processing chip using ordinary skill in 
the art. 

For the sake of briefness, the algorithm is described by t0 fi " d J*? ' ' " " f ^ Fina11 ^ the coefficients go, 8i. 
referring to equations in Byrnes et al.,i4 Complete Param- 45 ' g " are obtamed tiom 
eterization of All Positive Rational Extensions of a Covari- 
ance Sequence, supra. First solve the Levinson algorithm gJ - Po f. + V Pl (j^ A +. f i+j ) 
(equation 2.9 in the Byrnes et al. article) merely exchanging ' ' l^t ' ' ' ' +J 
c 0 c l3 . . . , c„ for q 0 , q l5 . . . , q„, thus obtaining the reflection 
coefficients Yo> Yi» • • • » Y„_i from (equation 2.7 in the Byrnes 50 
et al. article) and the Szego polynomials <(> 1 (z), $ 2 ( z )> • • > 
(j)„(z) from (equation 2.9 in the Byrnes et al. article). Next, 

determine the coefficients a lf c^, . . . , a„ an such that z^^Po-V^- 1 )* . . . +p„(z"+r«). 

55 Consequently, the covariance coefficients go» gi» • • • , 

4> / ,(2)+a 1 <j>„. 1 + . . . +a n -z / *, can be computed using just recursive algorithms and ordi- 
nary arithmetic operations, 

which amounts to solving a triangular linear system of What is claimed is: 

equations. Using the vectors a and y, with components a a , 1. Aspeech analyzer for speech segmented into a plurality 

a 2» ■ • • > and Yo> Yi> • • • » Y„-i respectively as initial 60 of speech frames comprising: means for determining filter 

conditions, the fast algorithm (equation 2.17 in the Byrnes et zeros and covariances from a speech frame, and means for 

al. article) converges to a limit as t— », provided the calculating a set of gains for an LLN filter which match said 

positivity condition (2.10) holds. In particular, a*(t)-»o> for filter zeros and covariances so that as said LLN filter is 

k-1, 2, . . . , n. Then excited by a selected signal said speech frame is regenerated. 

65 2. The speech analyzer of claim 1 wherein said calculating 
means includes means for computing a set of filter param- 

A(z)-a 0 (i-+o 1 2- 1 + . . . +oj, (a.1) eters from the filter zeros and filter covariances. 



then the recursion 

fn+k = -V. —fn-i-* 

4-4 an 



where p 0 , p lt . . . , p n are the coefficients in 
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3. The speech analyzer of claim 2 further comprising a 19. The speech synthesizer of claim 18 further comprising 
signal selector for determining said excitation signal. a telemetry link for receiving a set of filter parameters for 

4. The speech analyzer of claim 3 further comprising programming said LLN filter and a code indicative of a 
means for determining whether said speech frame is voiced desired excitation signal. 

or unvoiced. 5 20. The speech synthesizer of claim 18 further comprising 

5. The speech analyzer of claim 2 wherein said calculating means for programming said LLN filter in response to said 
means includes means for optimizing an objective function received filter parameters and wherein said excitation gen- 
which incorporates the filter zeros and filter covariances. erator is responsive to said code. 

6. The speech analyzer of claim 5 further comprising 21. The speech synthesizer of claim 20 further comprising 
means for determining an initial condition for said set of 10 a D/A converter for converting the output of said LLN filter 
gains for said LLN filter. to an analog waveform. 

7. The speech analyzer of claim 6 wherein said optimizing 22. A method for analyzing a frame of speech comprising 
means includes means for using only algorithms which are the steps of: determining a set of filter zeros and covariances 
defined solely in terms of arithmetic operations. from said speech frame, and calculating therefrom a set of 

8. The speech analyzer of claim 2 further comprising a 15 filter parameters comprising a set of gains for an LLN filter 
speech synthesizer in communication with said speech which match said filter zeros and covariances so that as said 
analyzer, said speech synthesizer including a programmable LLN filter is excited by a preselected signal said speech 
LLN filter, and an excitation generator connected to said frame is regenerated. 

LLN filter for generating said selected signal so that as said 23. The method of claim 22 wherein the calculating step 

programmable filter is configured in accordance with said 20 includes the step of computing a set of filter poles from the 

calculated set of gains, said speech frame is regenerated by filter zeros and covariances. 

said selected signal. 24. The method of claim 23 wherein the calculating step 

9. The speech analyzer of claim 8 wherein said speech further includes the step of optimizing an objective function 
synthesizer is remote from said speech analyzer and further which incorporates the filter zeros and covariances. 
comprising a telecommunications data link therebetween. 25 25. The method of claim 24 further comprising the step of 

10. The speech analyzer of claim 8 wherein said excila- determining an initial condition for said set of gains for said 
tion generator includes means for producing signals corre- LLN filter. 

sponding to either voiced or unvoiced speech. 26. The method of claim 25 wherein the optimizing step 

11. The speech analyzer of claim 8 wherein said excitation includes the step of using only algorithms which are defined 
generator includes a look up table having a plurality of 30 solely in terms of arithmetic operations. 

entries and a corresponding code for each of said table 27. The method of claim 23 further comprising the step of 

entries so that said speech analyzer need only determine and determining a desired signal for exciting said LLN filter to 

specify a look up table entry to thereby specify a desired regenerate said speech frame. 

signal. 28. The method of claim 22 further comprising the step of 

12. The speech analyzer of claim 11 wherein said speech 35 transmitting the filter parameters and a code corresponding 
analyzer further comprises means for determining a look up to the desired excitation signal to a speech synthesizer for 
table entry corresponding to a signal which most closely regeneration of said speech frame. 

produces the speech frame when used to excite the LLN 29. A method for regenerating a frame of speech corn- 
filter, prising the steps of: programming a programmable LLN 

13. The speech analyzer of claim 1 further comprising a 40 filter, generating a pre-selected signal for exciting said LLN 
signal selector for determining said excitation signal. filter, and exciting said LLN filter with said generated signal 

14. The speech analyzer of claim 13 further comprising to thereby regenerate said frame of speech. 

means for determining whether said speech frame is voiced 30. The method of claim 29 further comprising the steps 

or unvoiced. of receiving a set of filter parameters for programming said 

15. The speech analyzer of claim 1 wherein said calcu- 45 LLN filter, programming said LLN filter in accordance 
lating means includes means for optimizing an. objective therewith, and receiving a code indicative of said generated 
function which incorporates the filter zeros and filter cova- signal. 

nances. 31. The method of claim 30 further comprising the step of 

16. The speech analyzer of claim 1 further comprising converting the output of said LLN filter to an analog 
means for determining an initial condition for said set of 50 waveform. 

gains for said LLN filter. 32. A speech analyzer including a circuit for detecting a 

17. The speech analyzer of claim 1 wherein said optimiz- frame of speech, and a translator of converting said speech 
ing means includes means for using only algorithms which frame into a set of LLN filter parameters, said LLN filter 
are defined solely in terms of arithmetic operations. parameters being suitable for reproduction of said speech 

18. A speech synthesizer having a programmable LLN 55 frame through excitation of an LLN filter configured with 
filter, and an excitation generator for generating a pre- said filter parameters. 

selected signal for exciting said LLN filter to thereby 

regenerate a desired frame of speech. ***** 
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